Using semi-structured data for assessing research paper similarity

نویسندگان

  • Germán Hurtado Martín
  • Steven Schockaert
  • Chris Cornelis
  • Helga Naessens
چکیده

Article history: Received 8 December 2011 Received in revised form 24 May 2012 Accepted 26 September 2012 Available online 6 October 2012

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Trusting Semi-structured Web Data

The growth of the Web brings an uncountable amount of useful information to everybody who can access it. These data are often crowdsourced or provided by heterogenous or unknown sources, therefore they might be maliciously manipulated or unreliable. Moreover, because of their amount it is often impossible to extensively check them, and this gives rise to massive and ever growing trust issues. T...

متن کامل

Fingerprinting of some Egyptian rice genotypes using Intron-exon Splice Junctions (ISJ) markers

DNA fingerprinting has become an important tool for diversity assessment and varietal identification in plant breeding programs. Semi- random PCR primers targeting intron-exon splice junctions (ISJ) were used to evaluate the potential of these markers in identification and classification of rice genotypes. A total of 12 ISJ primers were used for screening fourteen Egyptian rice genotypes, inclu...

متن کامل

Very Fast Similarity Queries on Semi-Structured Data from the Web

In this paper, we propose a single low-dimensional representation for entities found in different datasets on the web. Our proposed PIC-D embeddings can represent large D-partite graphs using small number of dimensions enabling fast similarity queries. Our experiments show that this representation can be constructed in small amount of time (linear in number of dimensions). We demonstrate how it...

متن کامل

Similarity and Analogy over Application Domains

Databases, particularly when storing heterogeneous, sparse semistructured data, tend to provide incomplete information and information which is difficult to categorize. This paper first considers how to classify entity instances as members of entity classes organized in a lattice-like generalization/specialization hierarchy. Then, it describes how the frame representation employed for instances...

متن کامل

A Novel Method for Finding Similarities between Unordered Trees Using Matrix Data Model

Trees are capable of portraying the semi-structured data which is common in web domain. Finding similarities between trees is mandatory for several applications that deal with semi-structured data. Existing similarity methods examine a pair of trees by comparing through nodes and paths of two trees, and find the similarity between them. However, these methods provide unfavorable results for uno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Sci.

دوره 221  شماره 

صفحات  -

تاریخ انتشار 2013